Visualization and manipulation of stereophonic audio signals by means of IID and IPD
نویسندگان
چکیده
In this paper we will discuss a model aimed at improving the spectral data representation of stereophonic audio in a way that allows efficient stereophonic data visualization and linear manipulation of arbitrary parts of the stereo image. The stereo pair is here interpreted as a single spectrum with additional dimensions, expressing the Interaural Intensity Difference (IID) and Interaural Phase Difference (IPD) for each FFT bin. These dimensions are evaluated assuming that the stereo signal is an instantaneous mixture with a residual amount of convolutive phenomena. Even if this assumption is not generally true for the majority of music signals it is applicable to single stems or submixes used during music production or other signals that comes in pairs. After a brief overview of the state of the art in stereo data representation, we will introduce the proposed dimensions, then we will show how they can be displayed and finally we will suggest a technique to manipulate the stereophonic data in realtime.
منابع مشابه
Investigating Ideological Manipulation in Subtitling Based on Farahzad’s CDA Model: A Case Study of The Salesman
Translation plays an important role in conveying and manipulating ideologies. Accordingly, this study sought to analyze the ideological elements in the English subtitles of the Persian movie The Salesman. The framework to find the driven ideological strategies in the translation of the Persian audio of the same movie was based on the critical discourse analysis (...
متن کاملA Recursive Approximation Approach of non-iid Lognormal Random Variables Summation in Cellular Systems
Co-channel interference is a major factor in limiting the capacity and link quality in cellular communications. As the co-channel interference is modeled by lognormal distribution, sum of the co-channel interferences of neighboring cells is represented by the sum of lognormal Random Variables (RVs) which has no closed-form expression. Assuming independent, identically distributed (iid) RVs, the...
متن کاملCombining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)
Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...
متن کاملStereophonic acoustic echo cancellation system using time-varying all-pass filtering for signal decorrelation
This paper describes a novel technique for decorrelating the stereo signals in stereophonic acoustic echo cancellation (AEC) systems. At present, most teleconferencing systems use a single full-duplex audio channel for voice communications. However, in order to introduce spatial realism, future teleconferencing systems are expected to have more than one channel (at least stereo with two channel...
متن کاملAn Implementation of a Stereophonic Acoustic Echo Canceler on a General Purpose DSP
Teleconferencing systems employ acoustic echo cancelers to reduce echoes that results from the coupling between loudspeaker and microphone. To enhance the sound realism, two-channel audio is necessary. However, stereophonic acoustic echo cancellation is more difficult to solve because of the necessity to uniquely identify two acoustic paths, which becomes problematic since the two excitation si...
متن کامل